A Hybrid Learning Strategy for Discovery of Policies of Action

نویسندگان

Richardson Ribeiro

Fabrício Enembreck

Alessandro L. Koerich

چکیده

This paper presents a novel hybrid learning method and performance evaluation methodology for adaptive autonomous agents. Measuring the performance of a learning agent is not a trivial task and generally requires long simulations as well as knowledge about the domain. A generic evaluation methodology has been developed to precisely evaluate the performance of policy estimation techniques. This methodology has been integrated into a hybrid learning algorithm which aim is to decrease the learning time and the amount of errors of an adaptive agent. The hybrid learning method namely Klearning, integrates the Q-learning and K Nearest-Neighbors algorithm. Experiments show that the K-learning algorithm surpasses the Q-learning algorithm in terms of convergence speed to a good policy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On-Line Learning of a Persian Spoken Dialogue System Using Real Training Data

The first spoken dialogue system developed for the Persian language is introduced. This is a ticket reservation system with Persian ASR and NLU modules. The focus of the paper is on learning the dialogue management module. In this work, real on-line training data are used during the learning process. For on-line learning, the effect of the variations of discount factor (g) on the learning speed...

متن کامل

On-Line Learning of a Persian Spoken Dialogue System Using Real Training Data

متن کامل

Reinforcement learning based feedback control of tumor growth by limiting maximum chemo-drug dose using fuzzy logic

In this paper, a model-free reinforcement learning-based controller is designed to extract a treatment protocol because the design of a model-based controller is complex due to the highly nonlinear dynamics of cancer. The Q-learning algorithm is used to develop an optimal controller for cancer chemotherapy drug dosing. In the Q-learning algorithm, each entry of the Q-table is updated using data...

متن کامل

The Impact of Studio-based learning on Metacognition and Design Ability of Architecture Students - Action Research

Proper training can put design learners in the right direction. It also enhances the power of drawing. Objective of this study was the effectiveness of architectural studio-based learning on increasing drawing power and metacognition abilities of students. This research seeks to answer these questions: Can architectural studio-based learning increase student design ability? Can architectural st...

متن کامل

Involvement Load of Vocabulary Tasks IELTS preparation Vocabulary Course Books

The importance of vocabulary is undeniable. EFL learners need sufficient lexicon in order to bea competitive speaker. Lots of strategies have been proposed. The concept of involvement loadwas first introduced by Hulstijn and Laufer (2001). They believed that deeper explanation oflexical information will result in better retention of them. The present study aimed at finding the...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

A Hybrid Learning Strategy for Discovery of Policies of Action

نویسندگان

چکیده

منابع مشابه

On-Line Learning of a Persian Spoken Dialogue System Using Real Training Data

On-Line Learning of a Persian Spoken Dialogue System Using Real Training Data

Reinforcement learning based feedback control of tumor growth by limiting maximum chemo-drug dose using fuzzy logic

The Impact of Studio-based learning on Metacognition and Design Ability of Architecture Students - Action Research

Involvement Load of Vocabulary Tasks IELTS preparation Vocabulary Course Books

عنوان ژورنال:

اشتراک گذاری